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Abstract 

Fluorination has been widely used in chennical synthesis, but is rare in nature. 
The only known biological fluorination scope is represented by the fl pathway 
fronn Streptomyces cattleya that produces fluoroacetate (FAc) and 

4- fluorothreonine (4-FT). Here we report the identification of a novel pathway 
for FAc and 4-FT biosynthesis fronn the actinonnycetonna-causing pathogen 
Nocardia brasiliensis ATCC 700358. The new pathway shares overall 
conservation with the fl pathway in S. cattleya. Biochennical characterization of 
the conserved donnains revealed a novel fluorinase NobA that can 
biosynthesize 5'-fluoro-5'-deoxyadenosine (5'-FDA) fronn inorganic fluoride and 

5- adenosyl-l-nnethionine (SAM). The NobA shows sinnilar halide specificity and 
characteristics to the fluorination enzynne FIA of the fl pathway. Kinetic 
paranneters for fluoride (/C^4153 |iM, /c^^^ 0.073 pnin-"") and SAM (K^4^6 [iM, k 
^^^0.139 nnin""') have been deternnined, revealing that NobA is slightly (2.3 fold) 
slower than FIA. Upon sequence connparison, we finally identified a distinct 
loop region in the fluorinases that probably accounts for the disparity of 
fluorination activity. 
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Introduction 

The introduction of fluorine into organic molecules can often 
improve their molecular stability and pharmacological properties^ -. 
Organo-fluorines have been widely used in pharmaceuticals, diag- 
nostics, agrochemicals, and materials^'^, and it is estimated that 
20-30% of commercial drugs, including many top-sellers, contain 
fluorine^. The increasing prevalence and success of organo-fluorines 
have instigated enormous efforts over the past decades in develop- 
ing methodologies for efficiently introducing fluorine into organic 
molecules^'^', however the unique properties of fluorine make these 
chemical incorporations challenging, usually needing harsh syn- 
thetic conditions and giving rise to moderate chemo- or stereo- 
selectivity^'^. 

In contrast to chemical synthesis, biocatalytic synthesis or biosyn- 
thesis provide better efficacy and selectivity under mild conditions, 
and can be easily scaled up for industrial production^. Although nat- 
ural fluorination is rare, efforts in identifying new bio-fluorination 
machineries are continuing. Pioneered by O'Hagan and co-workers, 
the natural bio-fluorination pathway to produce fluoroacetate (FAc) 
and 4-fluorothreonine (4-FT) was biochemically characterized in 
the actinomycete Streptomyces cattleya^. Enzymatic fluorination in 
S. cattleya occurs via a fluorine-fixation step that is catalyzed by 
the unique fluorinase enzyme FIA through the nucleophilic attack 



of fluoride to SAM'^'^^ The resulting fluoro-intermediate 5'-fluoro- 
5 '-deoxy adenosine (5'-FDA) is further converted by five enzymes 
to give rise to the FAc and 4-FT (Figure lA and Bf. This power- 
ful bio-fluorination machinery has been proved very successful in 
incorporating both ^^F and ^^F into nucleotides ^-"^^ and in produc- 
ing the anticancer drug candidate fluorosalinosporamide^^. Very 
recently, its enormous potential for producing complex fluorinated 
molecules was demonstrated by Chang and co-workers 'I Using 
an engaging biosynthetic strategy they were able to achieve site- 
specific incorporation of FAc into the polyketide backbone of 
2-desmethyltriketide lactone. Merging this strategy with the FAC 
producing pathway, such as the fl pathway in vivo, could present 
opportunities to use living cells for the production of acetate- 
derived fluorinated natural products that include polyketides, fatty 
acids, terpenoids and steroids. 

Synthetic biology approaches need pools of "genetic elements" 
to mix and match to build and optimize synthetic bio-pathways^'^. 
However, the only natural bio-fluorination tool available so far is the 
fl pathway. The limited natural bio-fluorination machinery forms 
the bottleneck for development of efficient synthetic bio-pathway 
to produce fluorinated natural products, thus an expansion of the 
genetic resources of biological fluorination is highly desired. Here, 
we report the identification of a latent fluorination pathway for 
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Figure 1. Biosynthetic genes and pathways for FAc and 4-FT biosynthesis. (A) Organization of relevant genes in the //and nob pathway, 
encoding the conversion of fluoride to FAc and 4-FT (crosshatched), and auxiliary function (blackened). Iso: isomerase; Deh: dehydrogenase 
and Tran: transaldolase. Identical genes are indicated by dotted lines. (B) Proposed biosynthetic pathways to the FAc and 4-FT in the 
S. cattleya and N. brasiliensis. 
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FAc and 4-FT biosynthesis from the pathogen Nocardia brasiliensis 
ATCC 700358 and the biochemical characterization of a novel 
SAM-dependent fluorinase enzyme. 

Materials and methods 

General methods, biochemicals, and chemicals 

DNA isolation and manipulation in Escherichia coli were per- 
formed according to standard methods-^^'^^ PCR amplifications 
were carried out on an authorized thermal cycler (Veriti 96 Well, 
ABI) using PrimeSTAR Max DNA polymerase according to the 
manufacturer protocol (TaKaRa, Japan). Primer syntheses and 
DNA sequencing were performed at Genewiz BiotechCo., Ltd. 
(China). The E. coli DH5(2 cells were purchased from Invitrogen 
(Carlsbad, CA), and E. coli BL21 (DE^) cells were purchased from 
Novagen (Madison). A^. brasiliensis HUJEG-1 (ATCC 700358) was 
purchased from the American Type Culture Collection (ATCC), 
and S. cattleya DSM 46488 was provided by Prof. Hong-Yu Ou. 
Restriction enzymes were purchased from TaKaRa Biotechnology 
Co., Ltd. (Dalian, China). All other chemicals and reagents were 
purchased from Santa Cruz Biotechnology, Inc (USA) or Shanghai 
Sangon Biotech (China) Co., Ltd., unless noted otherwise. Analyti- 
cal HPLC was carried out on an SHIMADZU LC-20A Prominence 
HPLC system. LC-MS analysis was conducted on a Thermo Instru- 
ments HPLC system connected to a LCQ Fleet electrospray ioniza- 
tion (ESI) mass spectrometer (ThermoFisher Scientific Inc.). NMR 
data were collected using a Bruker 400 MHz spectrometer. 

Sequence analysis. The genetic open reading frame sequences 
(ORFs) in A^. brasiliensis were identified using the FramePlot 4.0 
beta program (http://nocardia.nih.go.jp/fp4/). The corresponding 
proteins were compared with other known proteins in the NCBI 
databases by using available BLAST methods (http://www.ncbi.nlm. 
nih.gov/blast/). Amino acid sequence alignments were performed by 
using the Strap program (http://www.bioinformatics.org/strap/). 

Cloning, overexpression and purification of NobA. The synthetic 
gene codon-optimized nobA for expression in E. coli using the 
OptimumGene™ algorithm was sourced from GenScript (Nanjing, 
China). The optimized sequence exhibited 82% identity to the 
original sequence and is shown below (codons optimized are 
underlined, restriction sites are in italics): GAArrCCArAT- 
GACGACGACGAATGGCCGCCGCCCGATTATTGCGTTI 
ATGAGCGAICTGGGTATCACCGACGACTCCGTGGCACA- 
GTGIAAGGGTCTGATGCTGAGCGTGTGCCCGGATGTGAC- 
GATTGTTGACATCTGTCATACCATGCAGCCGTGGGATGTG- 
GAAGAAGGTGCGCGITATATTGTTGACCTGCCGCGCCT- 
GTTTCCGGAGGGIACGGTTTTCGCAACCACGACCTAC- 
CCGGCAACCGGTACGACCGCACG TAGC GTCGCTCT- 
GCGTATCGCACATGCCTCTAAAGGCGGIGCACGIGGCz 
CAGTGGGCAGG TAGT GGTGCAGGTTTCGAACGCAA- 
GGAAGGCTCATATATTTACATCGCGCCGAACAATGGTCT- 
GCTGACGACCGTTATTAAAGAACACGGCTATCTGGAAGC- 
CTACGAAGTTAGCTCTCCGGAAGTCATCCCGGAACAAC- 
CGGAACCGACCTTTT ATTCA CGTGAAATGGTGGCACTGC- 
CGTCGGCICATCTGGCAGCAGGITTCCCGCTGGAAAAAz 
GTCGGTCGTCGCCTGGCAGAIGACGAAATTGTGCGTTTT: 
GAACGCAAGGATCCGGAACTGGTTGCCGATCACGAC- 
CTGGTCGGTTATGTGACCAACATTGATCATCCGTTTG- 
GCAACGTTTGGACGAATATCCACCGTACCGACCTG- 



GAAAAACTGGGIGTCGGCTACGGTACGAAGCTGCGCAT- 
TACCCTGGATGGTGTGCTGCCGTTTGAACTGCCGCTGTC- 
CCCGACCTTCGCAGATGCTGGCGAAATCGGTGCAGCIz 
GTGGCATATCTGAGTTCCCGTGGTTACCTGGCACTG- 
GCACGTAATGCTGCG TCG CTGGCGTATCCGTATAATCT- 
GAAGGCGGGTAT TTCG GTCCAAGTCAAAGTGGGCTAA- 
GCTT. The gene of FIA was PCR amplified from genomic DNA 
of S. cattleya using the forward primer 5 '-TTC ATATGGCTGC- 
GAACAGCACACGTC-3' and reverse primer 5'-TTAAGCT- 
TATCAGCGGGCCTCGACCCG-3'. The purified PCR product was 
ligated to pMD18-T simple (TaKaRa, Japan) following the manu- 
facturer protocol and its fidelity was confirmed by DNA sequencing 
(multi-color fluorescence-based DNA analysis). The Ndel-Hin^Hl 
fragments of nobA and^A were ligated to pET28a to yield the plas- 
mids pWHU2401 and pWHU2402, respectively, which were then 
been used to overexpress the NobA and FIA proteins as N-terminal 
6xHis-tagged fusion proteins. The resulting expression plasmids 
were transformed into E. coli BL21 (DE^) cells. The cells were grown 
in 0.8 L of liquid culture (IxLuria broth medium with 50 |ig/mL 
kanamycin) at 37°C to an OD^^^ of 0.6. The cells were allowed 
to cool room temperature and induced with 0.1 mM isopropyl-y^- 
D-thiogalactopyranoside (IPTG) for 10 hours at 25°C. The cells 
were centrifuged (6000 rpm, 10 minutes, 4°C) resuspended in 
15 mL lysis buffer (25 mM HEPES pH 7.5, 300 mM NaCl, 5 mM 
imidazole, 10% glycerol) and lysed by sonication. Cellular debris 
was removed by centrifugation (15000 rpm, 30 minutes, 4°C). Two 
mL Ni-NTA agarose resin was added to the supernatant and the 
solutions were shaken at 4°C for 1 hour. The protein resin mixtures 
were loaded into a gravity flow column, and proteins were sequen- 
tially eluted with 10 mL Buffer A (25 mM HEPES pH 7.5, 300 mM 
NaCl, 10% glycerol) supplemented with 50 mM, 100 mM and 
300 mM imidazole, respectively. Purified proteins (10 mL) were 
buffer exchanged by dialysis in 1 L Buffer B (25 mM HEPES, pH 
7.5, 50 mM NaCl, 10% glycerol) and concentrated by centrifugation 
using an Amicon Ultra-4 (10 KDa, GE Healthcare). Proteins purity 
(90-95%) was evaluated by 12% acrylamide SDS-PAGE. Protein 
concentrations of the NobA (5 mg mL"0, FIA (4 mg mL"0 and NobA- 
S158A (25 mg mL"0 were determined by the Bradford method using 
a BSA calibration curve. The final proteins were flash-frozen in 
liquid nitrogen and stored at -80°C. Gel filtration chromatography 
was used to determine the native molecular mass of NobA. Experi- 
ments were performed at a flow rate of 1 ml min'^ using an AKTA 
Purifier FPLC system (Amersham Pharmacia Biotech) and a Super- 
dex 200 GL column (Amersham Pharmacia Biotech). The elution 
buffer used was 50 mM phosphate buffer (pH 7.8). The native 
molecular mass of the enzyme was estimated from a calibration curve 
plotted by using the standard proteins carbonic anhydrase (29 kDa), 
bovine serum albumin (66 kDa), alcohol dehydrogenase (150 kDa), 
y^-amylase (200 kDa), apoferritin (443 kDa), and thyroglobulin 
(669 kDa). 

In vitro enzymatic assays of NobA. Typical assays were performed 
in 100 jiL of 20 mM sodium phosphate buffer (pH 7.5) containing 
1 mM SAM, 1-5 |iM enzyme and 20 mM NaF or NaCPl For chlo- 
rination reaction, additional 1 |iM L-amino acid oxidase (Sigma- 
Aldrich Co., Ltd.) was added in the reaction mixture. Reactions 
were incubated at 26°C, quenched by 10 |iL 100% trichloro- 
acetic acid (TCA) and the supernatants were subjected to HPLC or 
LC-HRMS analysis using an analytic Inertsil ODS-3 column 
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(5 |im, 4.6x250 mm, GL Science Inc). HPLC analysis was normally 
performed with a linear gradient of 5% to 20% CH3CN (v/v) over 
20 minutes, 20% to 5% CH3CN (v/v) over 1 minute, and 5% CH3CN 
(v/v) for further 4 minutes at a flow rate of 1 mL/min under 260 nm. 
HPLC-ESI-MS analysis was performed as described above. The 
kinetic parameters for NaF were determined with SAM maintained 
at a concentration of 0.4 mM and NaF at increasing concentrations 
from 0 to 20 mM. The kinetic parameters for SAM were deter- 
mined with a concentration of NaF maintained at 5 mM and SAM at 
increasing concentrations from 0 to 1 mM. The formation of 5'-FDA 
was determined by HPLC and quantified using a standard curve 
of ^S-adenosyl-L-homocysteine (SAH). Each data point represents 
a minimum of three replicate, end point assays were fitted to the 
Michaelis-Menten equation by OriginPro 9.0 (OriginLab software, 
Northampton, MA) to obtain estimates for k^^^ and K^. 

Determination pH and metal ion effect on the activity of NobA 
and FIA. A reaction mixture (100 \A) containing 20 mM NaF, 1 mM 
SAM in 20 mM sodium acetate (pH 4.5-5.5), sodium phosphate 
(pH 6.0-7.0), Tris-HCl (pH 7.5-9.0), or Gly-sodium hydroxide 
(pH 9.5-10.0) buffer was prepared. The reactions were initiated by 
the addition of 5 |iM NobA or FIA, incubated at 26°C for 30 minutes 
where the velocity is in the linear range, and then subjected to 
HPLC analysis. To measure the metal ion effect on the activ- 
ity, a group of 100 jil reaction mixtures containing 20 mM NaF, 
1 mM SAM in 20 mM Tris-HCl (pH 6.5) buffer were spiked with 
1 mM different divalent metal chloride salts (Mg^^, Mn^^, Fe^^, 
Cu^^ or Zn^+) or 1 mM ethylenediaminetetraacetic acid (EDTA) to 
remove the potential associated metal ions of the reaction mixture. 
Reactions were initiated by addition of 5 jiM NobA or FIA, and 



then incubated at 26°C for 60 min to HPLC analysis. Neither metal 
ions nor EDTA was added in the negative control reaction mixture. 
The formation of 5'-FDA was determined by HPLC and quanti- 
fied using a standard curve of S-adenosyl-L-homocysteine (SAH). 
Each data point represents a minimum of two replicate, end point 
assays were fitted to the Excel (Microsoft Corporation) to obtain 
scatter diagram (PH) and histogram (metal ions) for estimating 
relative activity. 

Construction and purification of nobA mutant. A nobA mutant 
was constructed using a standard PCR method-^ and the pWHU2401 
as a template. The NobA-S158A mutant was constructed with 
the primer pair 5'-GACCTTTTATGCACGTGAAATGG-3' and 
5'-CCATTTCACGTGCATAAAAGGTC-3' according to a stand- 
ard protocoP^ The construct of pWHU2403 was verified by DNA 
sequencing and was overexpressed in E. coli BL21 (DE^) as pre- 
viously described. The expression and purification procedures of 
recombinant NobA-S158A were carried out in a manner similar to 
that described for NobA. 

Results and discussion 

A putative gene {nobA) encoding a protein with overall 79% identity 
to the fluorinase RA was identified from the genome of N. brasiliensis 
ATCC 70035823 (Figure lA). By extending the searching region 
to the up-downstream of nobA, we further identified a gene cluster 
of 10 genes (nobA-l, Figure lA and Table 1). In this gene cluster, 
four catalytic genes (nobA, B, D and M) encode the homologues 
of 5'-FDA synthase, 5'-FDA phosphorylase, 5'-FDRP isomerase 
and 4-FT transaldolase which are responsible for the conversion 
of fluoride to 5-FDRibulP and fluoroacetaldehyde to 4-FT in the 



Table 1 . Putative orfs in the nob biosynthetic pathway from N. brasiliensis (accession number KF963271). 



Gene 


Size 


Protein homologue and origin 


Identity/similarity (%) 


Proposed function 


G 


223 


fIG (CAJ20008), from S. cattleya 
DSM 46488 


39/60 


DNA binding protein 
(regulatory) 


C 


340 


SCAT_p0565 (YP_004919857), from 
S. cattleya DSM 46488 


69/79 


Permease 


F 


191 


f/F (YP 004913663), from S. cattleya 
DSM 46488 


59/73 


DNA binding protein 
(regulatory) 


M 


659 


4-FTase (YP_006051324), from 
S. cattleya DSM 46488 


62/74 


4-fluorothreonine 
transaldolase 


D 


334 


5-FDRP isomerase (YP_006053901 ), 
from S. cattleya DSM 46488 


45/56 


5-FDRP isomerase 


E 


191 


adenine phosphoribosyltransferase 
(WP_0 16574607), from S. albulus 


57/66 


adenine phosphoribosyl 
transferase 


B 


292 


f/e(CAJ20005), from S. cattleya 
DSM 46488 


57/70 


5'-fluoro-5'-deoxy- 
adenosine phosphorylase 


1 


502 


fll (YP 004913660), from S. cattleya 
DSM 46488 


74/85 


S-adenosyl-L- 
homocysteine hydrolase 


A 


300 


flA (YP 004913664), from S. cattleya 
DSM 46488 


81/89 


5'-fluoro-5'- 

deoxyadenosine synthase 


H 


476 


flH (YP 004913661), from S. cattleya 
DSM 46488 


47/64 


NaVH+ antiporter 


J 


507 


Aldehyde dehydrogenase 

(YP 004910482.1), from S. cattleya 

DSM 46488 


78/87 


Aldehyde dehydrogenase 
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fl pathway) I Flanked those are genes for encoding auxihary func- 
tions, including regulation (nobC, F and G), transportation {nobC, 
H) and SAM recycling (nobE and /), among of which four {nobF, 
G, / and H) are homologous to the counterparts in the^ biosynthetic 
pathway. The overall identity to the ^pathway genes suggested that 
the nobK cluster might be related to the 4-FT and FAc biosynthesis. 



not idenfied in S. cattleya shared sequence homology to the aldo- 
lase genes (YP_0068 10507.1 and YP_0068 12798.1 respectively) 
of A^. brasiliensis. Their homology (both have 62% identity) is in 
line with the average identity (58%) between the counterpart genes 
of nob and fl pathway, suggesting that these genes are probably 
involved in the FAc biosynthetic pathway. 



Besides the four catalytic genes described above, other two genes 
encoding an aldolase and a dehydrogenase to mediate the conver- 
sion of 5-FDRibulP to fluoroacetaldehyde and fluoroacetaldehyde 
to FAc are necessary. A homology search for the fluoroacetalde- 
hyde dehydrogenase in the A^. brasiliensis resulted in the identi- 
fication of a gene with 78% identity {nobJ, YP_006807765.1; 
Figure lA and Table 1) from a remote site in the genome, sug- 
gesting that A^. brasiliensis has the potential to produce FAc. 
Unlike the others, the gene encoding aldolases in the fl pathway 
is still elusive. Recently, four aldolase genes were identified from 
S. cattleya by genome sequencing and two of them were thought 
to encode the conversion of 5-FDRibulP to fluoroacetaldehyde^^. 
However, in the A^. brasiliensis genome we could not find iden- 
tical homologues by BLAST searching. Because this reaction 
can also be catalised by fuculose aldolase^ enzyme, we searched 
for fuculose aldolase homologues, with no success. Instead, five 
putative aldolase genes (YP_0068 10507.1, YP_0068 12798.1, 
YP_006805383.1, YP_0068 12725.1 and YP_006809408.1) were 
identified in A^. brasiliensis based on the gene annotation in IMG 
database (https://img.jgi.doe.gOv/cgi-bin/w/main.cgi). We then 
compared the genetic sequences of the putative aldolases to their 
homologues in S. cattleya. Interestingly, two of these aldolase genes 
(YP_004910624 and YP_0049 19742.1) which were previously 



Previous studies revealed the thioesterase RK confers self-immunity 
to the FAc in the fl pathway^^ -^'. Inactivation of the flK gene does 
not affect cell growth in 2 M NaF^^, however can prevent produc- 
tion of fluorometabolites-^. Interestingly, we were not able to find 
the^^homologue in A^. brasiliensis, although there are a few genes 
encoding thioesterases. To test the ability of A^. brasiliensis to pro- 
duce fluorometabolites, the bacteria were subjected to fermentation 
with a supplement of 2 mM NaF as a fluorine source^^ Despite the 
N. brasiliensis grows normally, no new fluorine signal in the culture 
except fluoride could be detected by F-NMR (data not shown). The 
lack of fluorometabolites might be caused by improper fermenta- 
tion conditions or deficiency of the FlK homologue in the genome. 
In vivo gene manipulations including promoter activation and het- 
erologous complementation of the flK in A^. brasiliensis were also 
attempted, but failed due to the inaccessibility of genetic system. 

Since the in vivo studies were inaccessible, we turned to in vitro 
characterization. The 5'-FDA synthase which catalyzes the unique 
fluorine-fixation reaction is one of the most interesting enzymes 
for the communities of enzymology and synthetic biology, thus we 
focused on the biochemical characterization of the NobA enzyme. 
The nobA gene was first codon optimized, cloned as an A^-terminal 
6xHis tagged protein, and then expressed and purified from 
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Figure 2. Fluorinase catalyzed conversion of fluoride and SAM to 5'-FDA and L-methionine. A) SDS-PAGE analysis of purified 
recombinant proteins. Tlie expected sizes of NobA-S158A (34.7 kDa), NobA (34.7 kDa) and FIA (34.6 kDa) are indicated. Lane M, protein 
molecular weight standards. B) HPLC showing NobA-catalyzed production of 5'-FDA with various control reactions. Adenine and MTA 
(S-methyl-5'-thioadenosine) are from the degradation of SAM, and X is an impurity associated with NobA. C) HRMS confirming the identity of 
5'-FDA. The observed value (m/z [M+H]+ 270.1002) is consistent with the calculated value (m/z [M+H]+ 270.0996). 
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E. coli (Figure 2A). In our experiment, the expression yield of 
NobA (7.9 mg/L) was lower than the yield of FIA (9.2 mg/L), and 
like FIA it natively occurs as a hexamer (Figure 3)"\ The fluorina- 
tion activity of the recombinant protein was assessed in parallel with 
FIA which was utilized to produce the 5'-FDA as a control. After the 
reaction, the assay mixtures were quenched by 10% trichloroacetic 
acid (TCA) and subjected to HPLC analysis. A product strictly 
dependent on the addition of SAM, NaF and NobA was detected, 
and its identity was finally confirmed to be 5'-FDA by both 
HRMS and comparison to the product of FIA (Figure 2B and C). 
These results, taken together, indicate that NobA, like FIA, can 



confer fluorine fixation, and that the nob pathway might be related 
to FAc and 4-FT biosynthesis. 

With a new fluorinase in hand, we next compared the enzymatic 
properties of the protein. It has been previously reported that in 
the presence of L-amino acid oxidase (L-AAO), FIA can uti- 
lize CI" but not Br or \ to form the 5 '-Chloro-5'-deoxy adenosine 
(5'-ClDA)-l Similarly, we observed that NobA was also able 
to perform this reaction with a similar reactivity (Figure 4). The 
optimal pH of NobA was determined as 6.5, slightly lower than RA 
at 7.0 (Figure 5A). NobA shows no obvious effects on fluorination 
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Figure 3. Determination of the native molecular mass of NobA by gel filtration chromatography. Standard proteins (a-f) used are 
the following: thyroglobulin (669 kDa), apoferritin (443 kDa), y^-amylase (200 kDa), alcohol dehydrogenase (150 kDa), bovine serum 
albumin (66 kDa) and carbonic anhydrase (29 kDa). NobA has similar molecular weight to y^-amylase (200 kDa) suggesting it naturally 
presence as a hexamer. 
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Figure 4. Fluorinase catalyzed conversion of chloride and SAM to 5'-CIDA and L-methionine in the presence of L-amino acid 
oxidase. The production of 5'-CIDA was determined by HPLC and HRMS. The observed values (m/z [M+H]+ ^^C\ = 286.0692, ^^Cl = 
288.0662) are consistent with the calculated values (m/z [M+H]+ ^'>C\ = 286.0707, ^^ci = 288.0677). 
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Figure 5. pH and metal ions effect, and kinetic analysis of the fluorination reaction catalyzed by NobA (■) and FIA (•). (A) pH effect 
on NobA and FIA activity. The optimum activity of NobA at pH 6.5 and FIA at pH 7.0 were individually set as 100%. (B) Metal ion effect 
on NobA activity. The activity of negative control was set as 100%. (C) Kinetic analysis of the fluorination reaction. When the concentration of 
NaF was varied, the SAM was fixed at 0.4 mM, otherwise when the SAM was varied, the NaF was fixed at 5 mM. 



reactivity in the presence of metal ions, instead 1 mM Cu^^ or Zn^+ 
can severely inhibit its activity (Figure 5B). The activity can be 
slightly increased by adding EDTA, suggesting the fluorination 
reaction is metal independent. The optimal temperature for fluori- 
nation reactivity is 37°C, however at this temperature SAM degra- 
dation is also expedited. In contrast, as 26°C can provide acceptable 
activity and slow down the degradation of SAM, this temperature 
was chosen throughout our kinetic studies. Although previous FIA 
kinetic studies have been reported^"'^*^, the data are inconsistent. 
Thereby, to precisely compare their reactivities, both NobA and 
FIA were kinetically measured in this study (Figure 5C). The 
values of NobA were 4153 |iM for NaF and 416 |aM for SAM, about 
twice the values of FIA (2167 |aM for NaF and 210 |aM for SAM). 
The turnover numbers {k^J of NobA were 0.073 min"^ for NaF 
and 0.139 min"^ for SAM, which were similar to the values of FIA 
(0.11 min-i for NaF and 0.123 min'^ for SAM). The average kJK^ 
of FIA (5.07x10-5 |iM-^ min'^ for NaF and 5.86x10-^ \xM ' min ^ 
outnumbers 2.3 fold to the values of NobA (1.75x10 ^ |aM^ min^ 
for NaF and 3.3x10"^ laM'^ min-^ for SAM), and suggests that NobA 
is less efficient than FIA. 

Similar to RA, NobA retains the conservation of the unique 23-residue 
loop, which is related to site architecture and fluoride selectivity, 
and lacks the critical residues for SAM hydrolysis (Figure Sl)'^°''^^ 



This unique feature confirms that fluorinases form a distinct group 
of enzymes which differ from other SAM-binding members in the 
DUF62 enzyme family. The overall identity of the fluorinases is 
very high (79% identity), and only a region between the R192 to 
L202 (FIA residue numbering) is distinct (Figure S2). Interestingly, 
in the crystal structure of FIA, this region constitutes a loop which 
links the N- and C-terminal domains to form an interface where 
the SAM and fluoride are bound^^ The variety of the loop region 
can probably influence the domain interaction and binding affini- 
ties of the substrates, and this might be one of the reasons for the 
retardation observed for NobA. Moreover, the residue S158 of FIA 
plays critical role in fluoride binding and desolvation"^'^^. Interest- 
ingly, this residue also exists in the NobA (Figure S2). To confirm 
the essentiality of S158 for NobA activity, this residue was further 
mutated into an alanine (Figure 2A). The resulting recombinant pro- 
tein NobA-S158A completely lost fluorination activity (Figure 2B) 
and demonstrates it is essential for NobA activity. This observation 
also indicates that NobA shares an identical catalytic mechanism 
employed by NobA. 

Actinomycetoma is a zoonotically chronic, granulomatous and sub- 
cutaneous tissue infection caused by actinomycetes that may often 
lead to amputation or death^l A^. brasiliensis is the major causa- 
tive pathogen of the actinomycetoma infections in Mexico-^' '\ Its 
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pathogenic mechanism currently is unclear. FAc is a highly toxic 
poison (LD^Q = 0.1 mg/kg mouse » 1.2 laM)^"^. In the human body it 
is first converted into the anabolite (-)-erythro-fluorocitrate (EFc), 
which exerts its toxicity by irreversibly inhibiting aconitase to cause 
cell death (IC^^ = 0.01 |aM)^^'^^\ Interestingly, our research revealed 
the A^. brasiliensis has the genetic potential to produce FAc from 
fluoride. In human blood, the fluoride concentration is normally 
around 0.13 |iM and can increase to 3.2 joM in some fluorine-polluted 
areas^^, which indicates A^. brasiliensis can potentially produce FAc 
at levels that cause severe cytotoxicity and lead to development of 
the actinomycetoma. 

In summary, we identified a latent pathway for FAc and 4-FT bio- 
synthesis from the actinomycetoma pathogen A^. brasiliensis ATCC 
700358. Comparative analysis of the genomes of A^. brasiliensis 
and S. cattleya revealed two common aldolase genes that were 
putatively involved in the FAc and 4-FT biosynthesis. By in vitro 
characterization, we confirmed a novel fluorinase, NobA, that can 
biosynthesize 5'-FDA from inorganic fluoride and SAM. This new 
fluorinase has similar substrate selectivity and characteristics to 
the homologue FIA, with a slightly less efficiency of reaction (2.3 
fold). The lower reactivity might be attributed to the presence of a 
distinct loop region in the sequences of FIA and NobA, based on 
comparative sequence analysis. During the revision of this manu- 
script, O'Hagan and co-workers published the identification of 
the same fluorinase and two other fluorinases from S. sp MA37, 
Actinoplanes sp N902-109 'l These fluorinases are highly conserved 
(76-79% identity to each other) but all have the distinct loop region 
which is identified by this study (Figure S3). This feature indicates 
that fluorinases are a unique protein family whose evolution is also 
unusual, majorly concentrated at the interface of N- and C- domains. 
Moreover, by searching the homologue of the putative S. cattleya 
5-FDRibulP aldolases (YP_004910624 andYP_004919742.1, iden- 
tified in this study) in the geome of A. sp N902-109 results identi- 
fication of an aldolase (YP_007950 157.1). It shares 65% identity 
to the aldolase from S. cattleya (YP_0049 10624) and A^. brasiliensis 



(YP_0068 10507.1) indicating that this common homologue is 
likely responsible for the conversion of 5-FDRibulP. The discovery 
of new fluorinases and biosynthetic pathways increases the genetic 
resource of bio-fluorination and will benefit the future development 
of synthetic bio-pathways to produce fluorinated natural products. 

Data availability 

Data sets showing the fluorinase catalysed conversions of 
fluo-ride and chloride to 5'-FDA and 5'-ClDA are publicly 
available in ZENODO. 

ZENODO: Conversion of fluoride and chloride catalysed by 
SAM-dependent fluorinase in Nocardia brasiliensis, doi: 
10.528 l/zeno-do.8339'l 
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Supplementary materials 



FIA iVAANSTRRPIIAFMSDLGTTDDSVAQCKGLMYSICPDVTVVDVCHSMTPWDVEEGARYIVDLPRFFPEGT 

NobA IMTTTNGRRPIIAFMSDLGITDDSVAQCKGLMLSVCPDVTIVDICHTMQPWDVEEGARYIVDLPRLFPEGT 

SalL 1 MQHNLIAFLSDVGSADEAHALCKGVMYGVAPAATIVDITHDVAPFDVREGALFLADVPHSFPAHT 

PhDUF62 1 MITLTTDFGLKGPYVGEMKVAMLRINPNAKIVDVTHSVTRHSILEGSFVMEQVVKYSPKGT 

SaDUF62 1 MASTPWISFTTDYGLADGFVAACHGVLARLAPTARVIDVTHLVPPGDVRRGAAVLAQTVPYLP . AA 

94 116 
FIA yiVFATTTYPATGTTTRSVAVRIKQAAKGGARGQWAGSGAGFERAEgSYIYIAPNNGLLTT. . .VLEEHGYL 

NobA 7lVFATTTYPATGTTARSVALRIA ^ S | C qf^GQWjU3Sr^^f gfBR f ^gJ-^VTYTAPNM^^ . .VIKEHGYL 

SalL 66VICAYVYPETGTATHTIAVRNE KGQ LLVGPNNGLLSF . . .ALDASPAV 

PhDUF62 62VHVGVIDPGVGTERRAIVIEG DQYLVVPDNGLATLPLKHIKVKSVY 

SaDUF62 66VHVAVVDPGVGTARRAIALTAG NGLLVGPDNGLLLD . .AATALGGVD 

FIA 138EAYEVTSPKVIPEQPEPTFYSREMVAIPSAHLAAGFPLSEVGRPLEDHEIVRFNRPAVEQDGE .ALVGVV 

NobA 138EAYEVSSPEVIPEQPEPTFYSREMVALPSAHLAAGFPLEKVGRRLADDEIVRFERKDPELVADHDLVGYV 
SalL lllECHEVLSPDVMNQPVTPTWYGKDIVAACAAHLAAGTDLAAVGPRIDPKQIVRLPYASASEVEG.GIRGEV 
PhDUF62l08EIIPDKIRKFTGWEISSTFHGRDIFGPAGALIEKGIHPEEFGREIPVDSIVKLN.VEPRK.EGDVWILKV 
SaDUF62lllAAVELTNPDWLGARMSATFHGRDVFAPVAARLALGAPLADAGPAVEPGALVRLP.TPLVQPETDGFTAEV 

FIA 207SAIDHPFGNVWTNIHRTDLEKAGIGYGARLRLTLDGVLP . . FEAPLTPTFADAGEIGNIAI YLNSRGYLS 
NobA 208TNIDHPFGNVWTNIHRTDLEKLGVGYGTKLRITLDGVLP . . FELPLSPTFADAGEIGAAVAYLSSRGYLA 
SalL 180VRIDRAFGNVWTNIP.THLIGSMLQDGERLEVKIEALSDTVLELPFCKTFGEVDE.GQPLLYLNSRGRLA 

PhDUF62l76lYIDD.FGNVILNLENYEKPRTVELLDFNLR LPYLETYGLVEK . GEMLALPGSHDYLE 

SaDUF62 180LTVDH . FGNVQLAATGALLESLPRSLRVAHRP AVHARTFDDAPP . GGLLVHVDSAGLVA 



FIA 275IARNAASLAYPYHLKEGMSARVEAR 
NobA 276LARNAASLAYPYNLKAGISVQV 

SalL 248LGLNQSNFIEKWPVVPGDSITVSPRVPDSNLGPVLG 

PhDUF62232lAVNMGSAAERLNVKVGDELRVRLL 

SaDUF62237VAVNGGRAADLLAVTPGDQLRVTAG 



Figure S1. SAM-dependent haloginases and hydrolases of the DUF62 family. The residues highlighted in yellow are critical for SAM 
hydrolysis. The unique loops of fluorinase are highlighted in bright green. Accession numbers for DUF62 proteins are SalL from Salinispora 
tropica (ABP73643), PhDUF62 from Pyrococcus horikoshii 0J3 (NP_1 42440) and SaDUF62 from S. arenicola CNS-205 (ZP_01 648926). 
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Figure S2. Sequence comparison between FIA and NobA. The secondary structure of FIA (PDB No. 2V7V) is shown on the top. Critical 
residues (D16, D21, S23, W50, Y77, F156, N215, S269, R270 and A279) for SAM and fluoride binding are labeled by blue triangle). Residue 
S158 for fluoride desolvation is labeled by red pentagram. The distinct loop region (R192-L202, FIA numbering) which is bracketed by 
rectangle. 
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FIA 1 VAANSTRRPIIAFMSDLGTTDDSVAQCKGLMYSICPDVTWDVCHSMTPWDVEEGAR 

NobA lEFHMTTTNGRRPIIAFMSDLGITDDSVAQCKGLMLSVCPDVTIVDICHTMQPWDVEEGAR 

FlAl 1 MAANGSQRPIIAFMSDLGTTDDSVAQCKGLMHSICPGVTWDVCHSMTPWDVEEGAR 

F1A3 1 MSDLGTTDDSVAQCKGLMLSICPGVTIVDVNHSMTPWDVEEGAR 

FIA 58YIVDLPRFFPEGTVFATTTYPATGTTTRSVAVRIKQAAKGGARGQWAGSGAGFERAEGSY 
NobA 6 1 YIVDLPRLFPEGTVFATTTYPATGTTARSVALRI AHASKGGARGQWAGSGAGFERKEGSY 
FlAl 58YIVDLPRFFPEGTVFATTTYPATGTTTRSVAVRIRQAAKGGARGQWAGSGDGFERADGSY 
F1A3 45YIVDLPRFFPEGTVFATTTYPATGTATRSVALRIKQAAQGGARGQWAGSGAGFERAEGSY 

FIA 1 18IYIAPNNGLLTTVLEEHGYLEAYEVTSPKVIPEQPEPTFYSREMVAIPSAHLAAGFPLSE 
NobA121IYIAPNNGLLTTVIKEHGYLEAYEVSSPEVIPEQPEPTFYSREMVALPSAHLAAGFPLEK 
FlAl 1 18IYIAPNNGLLTTVLEEHGYIEAYEVTSTKVIPANPEPTFYSREMVAIPSAHLAAGFPLAE 
F1A3105IYIAPNNGLLTTVIEEHGYIEAYEVSNTKVIPAEPEPTFYSREMVAIPSAHLAAGFPLNE 



FIA 178VGRPLEDHEIVRFNPP . AVEQDGE^^ LVGWSAIDHPFGNVWTNIHRTDLEKAGIGYGARL 
NobA18lVGRRLADDEIVRFEBKDPELVADHELVGYVTNIDHPFGNVWTNIHRTDLEKLGVGYGTKL 
FlAl 178VGRRLDDSEIVRFHPP . AVEI SGEi^ LSGWTAIDHPFGNIWTNIHRTDLEKAGIGQGKHL 

F1A3 1 rf;vgralsddeivrfak Ipkpstvsggm lsgvitnidhpfgnlwtnihrtdlekagigyqtql 

FIA 237RLTLDGVLPFEAPLTPTFADAGEIGNIAIYLNSRGYLSIARNAASLAYPYHLKEGMSARV 
NobA24lRITLDGVLPFELPLSPTFADAGEIGAAVAYLSSRGYLALARNAASLAYPYNLKAGISVQV 
F1A1237KIILDDVLPFEAPLTPTFADAGAIGNIAFYLNSRGYLSLARNAASLAYPYNLKAGLKVRV 
F1A3225RLLLDGVLTFDLPLVPTFADAGQIGDPVIYINSRGYLALARNAAPLAYPYNLKAGLTVTV 

FIA 297EAR 
NobA30lKVG 
F1A1297EAR 
F1A3285TKA 

Figure S3. Sequence comparison between FIA, NobA, FIA1 and FIA3. Conserved residues are indicated by briglit yellow. The distinct loop 
region (R192-L202, FIA numbering) is highlighted by rectangle. FIA and NobA are from this study; FIA1 is from Streptomyces sp. MA-37 and 
FIA3 is from Actinoplanes sp. N902-10939. 
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account for these differences, particularly in the values for SAM. 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 
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This is an interesting article dealing with the characterization of a fluorinase from Norcardia brasiliensis. 
This research was obviously carried out independently and met publishing scientific standard, so it 
deserves publication in FlOOOResearch, although similar work has been published (Deng etal., 2014). 
This work mainly focused on the comparative studies on the fluorination of SAM to form 5'-FDA by two 
fluorinases NobA and FIA. Most of the experiments were well executed, except for the kinetic 
determination for these two enzymes. I suggest the authors reinvestigate the kinetic parameters, since it 
was not appropriate to determine the Km for NaF with a "maintaining" concentration of SAM at only 0.4 
mM (the author stated that Kms for SAM of FIA and NobA were 0.2 mM and 0.4 mM, respectively). A 
saturation concentration is normally around 5 X Km. 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 
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